Fix issue with async_scheduling when dealing with chunked input #359

tianmu-li · 2025-10-08T18:25:03Z

When dealing with chunked prompt (input sequence length > max_num_batched_tokens), sometimes there is no output token due to the chunked prompt, but the scheduler expects one. This PR addresses this issue and aligns the behavior with gpu_model_runner.

github-actions · 2025-10-08T18:25:14Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Signed-off-by: Tianmu Li <[email protected]>

github-actions · 2025-10-08T18:32:54Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Signed-off-by: Tianmu Li <[email protected]>

tianmu-li · 2025-10-09T01:39:31Z

@afierka-intel Can you check if this fixes the issue with long context? It works from my tests.

afierka-intel · 2025-10-09T06:21:51Z

/run-gaudi-tests

michalkuligowski · 2025-10-09T07:05:48Z

/run-gaudi-tests

michalkuligowski

Please cherrypick to 0.11 also

michalkuligowski · 2025-10-09T09:12:13Z

/run-gaudi-tests

github-actions · 2025-10-09T12:00:34Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
da3fa78dc98f3001e5fb703729a77311146e0cd3

michalkuligowski · 2025-10-10T07:48:03Z

/run-gaudi-tests

michalkuligowski · 2025-10-13T05:16:25Z

/run-gaudi-tests

github-actions · 2025-10-13T07:28:26Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
da3fa78dc98f3001e5fb703729a77311146e0cd3

michalkuligowski · 2025-10-13T08:38:20Z

/run-gaudi-tests

github-actions · 2025-10-13T12:05:03Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
da3fa78dc98f3001e5fb703729a77311146e0cd3

michalkuligowski · 2025-10-14T10:28:24Z

/run-gaudi-tests

Cherry-pick of #359 --------- Signed-off-by: Tianmu Li <[email protected]> Co-authored-by: Michał Kuligowski <[email protected]>

…-project#360) Cherry-pick of vllm-project#359 --------- Signed-off-by: Tianmu Li <[email protected]> Co-authored-by: Michał Kuligowski <[email protected]>

approved pre-maturily

piotrbocian · 2025-10-14T15:42:05Z

I propose to merge to 0.11 and main branches, and cherry-pick to 0.10 if no new issues detected.
This change is fix for https://jira.habana-labs.com/browse/SW-241672.

pawel-olejniczak · 2025-10-22T11:55:51Z

Will this change be merged into v0.10.2_next?
@tianmu-li @piotrbocian

pawel-olejniczak · 2025-10-22T14:22:48Z

/run-gaudi-tests

sys-hab-pt-service · 2025-10-22T14:23:11Z

Only codeowners and testowners can request to run Gaudi tests. Contact list: kzawora-intel, xuechendi, mswiniarsk, adobrzyn, mgawarkiewicz-intel, vivekgoe, afierka-intel, michalkuligowski, iboiko-habana, PatrykWo, kamil-kaczor, kfojcik-intel, ksmusz, wuxun-zhang, xuechendi, attafosu, ulivne, Kacper-Pietkun, iboiko-habana, jkaniecki

PatrykWo · 2025-10-22T15:50:34Z

@piotrbocian the fix is for blocker, Please approve.

PatrykWo · 2025-10-22T15:50:44Z

/run-gaudi-tests

github-actions · 2025-10-22T18:32:02Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
01efc7ef781391e744ed08c3292817a773d654e6

…-project#360) Cherry-pick of vllm-project#359 --------- Signed-off-by: Tianmu Li <[email protected]> Co-authored-by: Michał Kuligowski <[email protected]>

Fix issue with async_scheduling when dealing with chunked input

09c183d

Signed-off-by: Tianmu Li <[email protected]>

tianmu-li force-pushed the async_scheduling_chunk_fix branch from 1f87a51 to 09c183d Compare October 8, 2025 18:32

tianmu-li marked this pull request as ready for review October 8, 2025 18:42

tianmu-li requested review from mgawarkiewicz-intel, piotrbocian and wpyszka as code owners October 8, 2025 18:42

tianmu-li mentioned this pull request Oct 8, 2025

Fix issue with async_scheduling when dealing with chunked input #360

Merged

tianmu-li changed the title ~~[WIP] Fix issue with async_scheduling when dealing with chunked input~~ Fix issue with async_scheduling when dealing with chunked input Oct 8, 2025

tianmu-li added 2 commits October 8, 2025 22:33

Dummy commit

5e339b1

Signed-off-by: Tianmu Li <[email protected]>

Clarify invalid_req_indices

68a272e

Signed-off-by: Tianmu Li <[email protected]>

michalkuligowski approved these changes Oct 9, 2025

View reviewed changes

Merge branch 'v0.10.2_next' into async_scheduling_chunk_fix

d2c88fe

Merge branch 'v0.10.2_next' into async_scheduling_chunk_fix

828401e

Merge branch 'v0.10.2_next' into async_scheduling_chunk_fix

6530113

michalkuligowski added a commit that referenced this pull request Oct 14, 2025

Fix issue with async_scheduling when dealing with chunked input (#360)

8c08770

Cherry-pick of #359 --------- Signed-off-by: Tianmu Li <[email protected]> Co-authored-by: Michał Kuligowski <[email protected]>

piotrbocian previously approved these changes Oct 14, 2025

View reviewed changes

Merge branch 'v0.10.2_next' into async_scheduling_chunk_fix

e8724b1

piotrbocian self-requested a review October 23, 2025 02:32

piotrbocian approved these changes Oct 23, 2025

View reviewed changes

piotrbocian merged commit 4a8c529 into vllm-project:v0.10.2_next Oct 23, 2025
32 of 33 checks passed

Fix issue with async_scheduling when dealing with chunked input #359

Fix issue with async_scheduling when dealing with chunked input #359

Uh oh!

Conversation

tianmu-li commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 8, 2025

🚧 CI Blocked

Uh oh!

github-actions bot commented Oct 8, 2025

🚧 CI Blocked

Uh oh!

tianmu-li commented Oct 9, 2025

Uh oh!

afierka-intel commented Oct 9, 2025

Uh oh!

michalkuligowski commented Oct 9, 2025

Uh oh!

michalkuligowski left a comment

Choose a reason for hiding this comment

Uh oh!

michalkuligowski commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 9, 2025

✅ CI Passed

Uh oh!

michalkuligowski commented Oct 10, 2025

Uh oh!

michalkuligowski commented Oct 13, 2025

Uh oh!

github-actions bot commented Oct 13, 2025

✅ CI Passed

Uh oh!

michalkuligowski commented Oct 13, 2025

Uh oh!

github-actions bot commented Oct 13, 2025

✅ CI Passed

Uh oh!

michalkuligowski commented Oct 14, 2025

Uh oh!

piotrbocian commented Oct 14, 2025

Uh oh!

pawel-olejniczak commented Oct 22, 2025

Uh oh!

pawel-olejniczak commented Oct 22, 2025

Uh oh!

sys-hab-pt-service commented Oct 22, 2025

Uh oh!

PatrykWo commented Oct 22, 2025

Uh oh!

PatrykWo commented Oct 22, 2025

Uh oh!

github-actions bot commented Oct 22, 2025

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

tianmu-li commented Oct 8, 2025 •

edited

Loading